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System and Method for Detecting Dirty Data Fields 

FIELD OF THE INVENTION 

This invention relates generally to documents and objects having multiple data fields, in 
which the data in those data fields can be modified by a user. More specifically, this invention 
relates to a system and method of indicating when changes to the data in those data fields have 
occurred. 

BACKGROUND OF THE INVENTION 

Information in documents or objects capable of accepting input from computing application 
users can be used for viewing, updating, and creating data, where the data is stored in one or more 
data storage devices. More specifically, it is common to store data received as input from such users 
in one or more databases residing on such data storage devices. When changes initiated by 
application users are to be made to the data through such documents or objects, it is necessary to 
update the data in the databases to reflect these changes. 
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Some documents capable of accepting input from computing application users include a 
single form having multiple data fields. For example, a Hypertext Markup Language (HTML) page 
which includes an HTML form may be used for the purpose of displaying data to a user through a 
web browser, and accepting changes to the data as input. Values assigned to those data fields may 
be retrieved from one or more databases and displayed to the user. The user can then change the data 
in the HTML page by assigning a new value to certain data fields in the HTML form as specified 
by the user. 

Other documents or objects capable of accepting input from computing application users may 
have a hierarchical structure, in which the main document or object may consist of several 
sub-documents or objects. Each sub-document or object may also consist of several sub-documents 
or objects. A sub-document or object may have multiple data fields. Values assigned to those data 
fields may be retrieved from one or more databases and displayed to the user. The user can then 
change the data by assigning a new value to certain data fields as specified by the user. 

Typically, when changes are made to the data in a document or object (e.g. an HTML page), 
the data in associated databases must be updated to reflect those changes. One technique is to update 
the values of all data fields (i.e., whether or not the data has been changed) of the document in the 
databases, by overwriting the value of each data field as stored in the databases with the current 
value of the corresponding data field in the document or object. At any given time, the current value 
of a data field reflects the most recent changes made, if any, to the value of that data field. However, 
this technique can be inefficient, particularly for documents or objects comprising a large amount 
of data where the data of only a few data fields may have been changed by the user. 

Other strategies for updating data in a database may be used, however, many of these other 
strategies require the identification of which specific data fields in the document or object have been 
changed. 

U.S. Patent Number 5,317,730 discloses a system and method for dynamically processing 
a list of data objects retrieved from a database. A "cut & paste filter" ("C&PF") permits users to 
work with lists of data objects in an object-oriented computing system. The C&PF is capable of 
displaying lines of data on a screen and permitting changes to be made to the data without actually 
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committing those changes to the database. Until the user commits the changes to the database, the 
lines of data displayed by the C&PF are considered to be "dirty" (i.e., modified). 

The system and method described in U.S. Patent Number 5,3 1 7,730 do not permit specific 
data fields which have had their values changed by a user to be marked as "dirty". The system and 
method described in U.S. Patent Number 5,3 1 7,730 also do not relate to the handling of "dirty" data 
in documents or objects comprising one or more levels of sub-documents or other objects, or data 
update techniques for such documents. 

U.S. Patent Number 5,768,51 1 discloses a system and method for managing objects in a 
networked client-server computing system. An application program creates and stores data in the 
form of objects in a database. When a window or panel in the client is to use an object stored in the 
database, a "partial" object comprising a number of selected attributes which contain data is created 
in the client. For each of these attributes, the object includes a "clean" field containing data as stored 
in the database, and a "dirty" field containing any changes to that data which have yet to be saved 
in the database. Data in the "dirty" field can be used to update the existing objects in the database. 

The system and method described in U.S. Patent Number 5,768,5 1 1 do not teach the creation 
of a record containing entries indicating only which data fields in a document are "dirty", to be sent 
back to the server with the data to be saved in the database. The system and method described in 
U.S. Patent Number 5,768,51 1 also define multiple fields for each attribute in the created "partial" 
object. This can be inefficient particularly for large objects having many attributes, and where the 
data of only a small number of those attributes have changed in value. 

U.S. Patent Number 5,864,849 discloses a system and method for restoring a multiple 
checkpointed database. A checkpoint is a copy of a database stored apart from the database itself. 
A "dirty page' table is associated with each checkpoint to keep track of records that have been 
changed or updated since the most recent update of the database. 

The system and method described in U.S. Patent Number 5,864,849 are directed towards the 
restoration of corrupted databases and do not relate to the updating of data in databases based on 
information obtained from a user through documents such as HTML pages. The system and method 
described in U.S. Patent Number 5,864,849 also do not relate to the handling of "dirty" data in 
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documents or objects comprising one or more levels of sub-documents or other objects, or data 
update techniques for such documents or objects. 

U.S. Patent Number 5,974,238 discloses a system and method for communicating data 
records between a desktop computer and a handheld computer, specifically data records relating to 
calendars, telephone directories, to-do lists, and the like. The system may also provide for a file 
viewer or browser to allow for the viewing of files of particular types of common applications 
including word processor, spreadsheet and database files. The system also provides for tags which 
are capable of indicating whether certain records are "dirty" (i.e. modified). 

The system and method described in U.S. Patent Number 5,974,238 do not permit specific 
data fields in a document which have had their values changed by a user to be marked as "dirty". 
The system and method described in U.S. Patent Number 5,974,238 also do not relate to the 
handling of "dirty" data in documents or objects comprising one or more levels of sub-documents 
or other objects, or data update techniques for such documents or objects. 

Accordingly, there is a need for a system where specific data fields in a document which have 
had their values changed by a user can be marked as "dirty" to facilitate the updating of databases 
containing values for those data fields. There is also a need for system and method of handling 
"dirty" data in documents or objects comprising one or more levels of sub-documents or other 
objects. 



SUMMARY OF THE INVENTION 

The present invention relates to a system and method for indicating when changes to values 
of data fields in a document have occurred. 

One aspect of the present invention is a method of indicating when changes to values of data 
fields in a document have occurred, comprising the steps of storing in at least one first object initial 
values of the data fields in a document, initially storing in at least one second object the same values 
as stored in the first object and subsequently modifying the values in the at least one second object 
to reflect the most current values of the data fields in a document, comparing the values stored in the 
first and second objects to determine which data fields are "dirty", creating records identifying the 
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"dirty" data fields, and transmitting the records to a server for updating databases which store data 
for the data fields in the document. 

Another aspect of the present invention relates to a system for indicating when changes to 
values of data fields in a document have occurred comprising at least one first object for storing the 
5 initial values of data fields in a document, at least one second object for storing the current values 
of those data fields, and a module for comparing the values in the first and second objects, for 
determining which data fields are "dirty", and for creating records that indicate which data fields are 
"dirty". 

It will be appreciated by those skilled in the art that the invention can be embodied in a 
1 0 computer program which can be stored in storage or transmitted as a signal, such as on a modulated 
carrier signal for use in a computer system, or on a network such as the Internet. 

BRIEF DESCRIPTION OF THE DRAWINGS 

m_ For a better understanding of the present invention, and to show more clearly how it may be 

carried into effect, reference will now be made, by way of example, to the accompanying drawings 
1 W which show a preferred embodiment of the present invention, and in which: 
1 Figure 1 is a schematic diagram illustrating the present invention; and 

Figure 2 is a flowchart illustrating the steps performed in the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In the claims and in the specification, a notebook is defined as a Graphical User Interface 
20 (GUI) object that allows information to be displayed using a set of one or more panels, where each 
panel contains related information. Each panel comprises multiple data fields. Values assigned to 
those data fields can be displayed and subsequently changed by a user by assigning a new value to 
the appropriate data fields. Since a notebook can comprise numerous such panels, the data in a 
notebook can be changed by a user by modifying the values of data fields in the notebook's various 
25 panels. 

Furthermore, in the specification and in the claims, a data field is considered to be "dirty" 
if the data field has a current value which differs from its initial value. The initial value of a data 
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field in a document or object is the value of the data field as stored in the databases at the time the 
document or object is opened or created. The current value of a data field in a document or object 
is the value of the data field in the document, reflecting the most recent changes made, if any, to the 
value of the data field by a user. 

More generally, an object is considered to be "dirty" if the object consists of at least one 
"dirty" data field. By determining which data fields in a document or object have become "dirty", 
different database update strategies may be used. 

For example, some database update strategies for data in a notebook may include: 

(i) Dirty field update: 

only "dirty" fields in a notebook are updated 

(ii) Dirty panel update: 

only the "dirty" and non-"dirty" fields of "dirty" panels are updated 

(iii) Dirty notebook update: 

only the "dirty" and non-"dirty" fields of a "dirty" notebook are 
updated 

(iv) Optimistic locking algorithm: 

the non-"dirty" fields of a notebook are compared with their 
corresponding values in the databases and the "dirty" fields of the 
notebook are updated only if the compared values are the same; 
this algorithm may be used in multi-user environments to improve 
the integrity of data in databases 

The present invention relates to a system and method for indicating when changes have been 
made by a user to the values of data fields in a document or object. Put another way, the system and 
method of the present invention allows for the identification of "dirty" data fields in a document or 
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object. This permits different update strategies including those discussed above to be used in 
updating the databases in which data associated with the document or object is stored. 

Referring to Figure 1 , a system for indicating when changes to values of data fields in a 
document have occurred is shown generally as 1 0. An instance of a notebook 20 containing one or 
5 more panels 30 is displayed on a user's screen 40 which is connected to a client personal computer 
(PC) 50. The panels 30 of the notebook instance 20 may be used to display data to a user as dictated 
by an application module 60 running on the client PC 50. However, the application module 60 need 
not reside on the client PC 50 and may be running on any other computing machine connected to 
the client PC 50. 

10 As mentioned earlier in this specification, a notebook 20 is a GUI object that allows 
information to be displayed through one or more panels 30, where each panel 30 contains related 

~S information. A user can view the data in a panel 30 and switch between panels 30 by selecting a tab 
!; jf associated with each of the panels 30 or by another means as will be known to those skilled in the 
In art. Typically, only one panel 30 is selected for display and editing of data at any one time. 

1 1 j The data to be displayed in the notebook instance 20 is stored in one or more databases 80 
vy residing on a server 90 connected to the client PC 50 by a network connection 95. Each panel 30 
H ; of the notebook instance 20 contains multiple data fields 100. Each data field 100 can display a 
fy value and/or allow the user to change the value of the data field 1 00. The input fields may include 

text fields, check boxes, radio buttons, etc. For example, consider a notebook instance 20 

2#* identified as "contact" for displaying information about contacts. Assume the "contact" notebook 
contains three panels 30, the first panel identified as "general", the second panel identified as 
"address", and the third panel identified as "numbers". Also assume that the "general" panel 
contains three data fields 100 identified as "name", "age", and "birthdate", that the "address" panel 
contains five data fields 100 identified as "street", "city", "province", "country", and "postal code", 
25 and that the "numbers" panel contains three data fields 100 identified as "workphone", 
"homephone", and "fax". 
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The data associated with an instance of a notebook 20 might be the following, as represented in 
XML format: 



<?xml version = "1.0" encoding = "iso-8859-l"?> 
<contact> 
<general> 

<name>Gerry Kovan</name> 
<age>28</age> 

<birthdate>0 1/01/1 972</birthdate> 

</general> 
<address> 

<street>123 Anywhere Street</street> 
<city>Toronto</ city> 
<province>Ontario</province> 
<country>Canada</country> 
<postalcode>M3C lW3</postalcode> 

</address> 
<numbers> 

<workphone>4 16-123- 1 234</workphone> 
<homephone>4 1 6-999-0000</homephone> 
<fax>4 1 6- 1 23-0000</fax> 

</numbers> 
</contact> 

Assume in this example, that a user opens this instance of a "contact" notebook 20 and changes the 
information associated with the following data fields 100: age, street, postalcode, workphone, and 
fax. The data fields 100 that have changed are considered to be "dirty". 

In accordance with the present invention, a first object 110 and a second object 115 are 
created by the application module 60 for each panel 30 associated with a notebook instance 20 at the 
time the notebook instance 20 is created. Each first object 1 10 and second object 115 created is 
stored in a memory or storage device 120. The memory or storage device 120 may be, more 
specifically, a disk, cache memory, conventional RAM or other memory or storage device as known. 

The first object 110 associated with a specific panel 30 stores values corresponding to the 
initial state of the data associated with data fields 100 of the specific panel 30, upon the opening of 
the notebook instance 20 by the application module 60. The second object 1 1 5 associated with the 
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specific panel 30 initially also stores values corresponding to the initial state of the data associated 
with data fields 1 00 of the specific panel 3 0, but is subsequently updated whenever changes are made 
to the data in the specific panel 30 by a user. The second object 115 associated with the specific 
panel 30 is used to retain the most current values of the data fields 100 in the specific panel 30, with 
the values in that second object 115 reflecting any changes made to the data associated with those 
data fields 100 in the specific panel 30. 

In accordance with the present invention, one or more dirty data field records 130 created by 
the application module 60 comprise a list of data fields 100 identified as "dirty". For example, the 
dirty data field records 130 may comprise a list of identifiers of data fields 100, each of which is 
assigned a boolean value of "true" to indicate whether the data field 1 00 identified is "dirty". A data 
field 1 00 is considered "dirty" if the current value of the data field 1 00 differs from the initial value 
of the data field 100. This can be determined for each panel 30 in a notebook instance 20 by 
comparing the data associated with the data field 1 00 stored in the first object 1 1 0 and second object 
1 15 for the specific panel 30. 

When a user submits the data in a notebook instance 20 for processing, the application 
module 60 sends to the server 90 the dirty data field records 130 along with the data in the notebook 
instance 20. The dirty data field records 1 30 can then be used by a data update module 140 on the 
server 90 to update the data in the databases 80 with data received from the user through the panels 
30 of the notebook instance 20. 

According to the example provided above with respect to the "contact" notebook, the data 
sent to the server 90 when the data in that notebook instance 20 is submitted for processing can be 
represented in XML, incorporating the contents of the dirty data records 130 as follows: 

<?xml version = "1.0" encoding = "iso-8859-l"?> 

<contact> 
<general> 

<name>Gerry Kovan</name> 
<age>30</age> 

<birthdate>01/01/1972</birthdate> 
<dirtyData> 

<age>true</age> 
</dirtyData> 
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</general> 
<address> 



<street>789 Anyplace Road</street> 
<city>Toronto</ city> 
<province>Ontario</province> 
<country>Canada</country> 
<postalcode>M2C lX3</postalcode> 
<dirtyData> 

<street>true</street> 

<postalcode>true</postalcode> 
</dirtyData> 



<workphone>4 1 6- 1 23 -4444</workphone> 
<homephone>4 1 6-999-0000</homephone> 
<fax>416-555-0000</fax> 
<dirtyData> 

<workphone>true</workphone> 

<fax>true</fax> 
</dirtyData> 



</address> 
<numbers> 



</numbers> 
</contact> 



The present invention thus permits the data update module 1 40 and other application modules 
to determine whether data received from a user has been changed by that user at different levels, for 



instance: 



(hi) 



(ii) 



at the field level : is a field dirty? 

at the panel level: does a panel contain at least one field that is 
dirty? 

at the notebook level : does a notebook contain at least one panel 
that is dirty 
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This facilitates the use of many different database update strategies to be applied by a data update 
module 140 or other application modules in updating data in databases 80. 

Referring to Figure 2, a method of indicating when changes to values of data fields in a 
document have occurred is shown as a series of steps commencing at step 150. 

At step 1 60, a user requests that an instance of a notebook 20 be opened. The data associated 
with the notebook instance 20 is then displayed on a screen 40. The notebook instance 20 is created 
by an application module 60, which in the preferred embodiment, resides on a client PC 50. 
Application module 60 initially selects a panel 30 to be the currently selected panel for display on 
the screen 40. 

At step 1 65, application module 60 determines whether a first object 100 and a second object 
1 15 already exist for the currently selected panel. If so, the flow of method steps proceeds to step 
190, otherwise the flow of method steps proceeds to step 170. 

At step 1 70, application module 60 creates a first object 1 1 0 and a second object 1 1 5 for the 
currently selected panel 30 in the notebook instance 20, and retrieves data associated with the data 
fields 100 on the currently selected panel 30 from databases 80 located on a server 90. The values 
of the data fields 100 as retrieved from the databases 80 are stored in both the first object 1 10 and 
the second object 1 1 5 for the selected panel 30. These values represent the initial values of the data 
fields 100 of the selected panel 30. 

At step 180, application module 60 loads the currently selected panel 30 with data in the 
corresponding first object 110. 

At step 190, optionally, a user changes the value of a user-specified data field 100 in the 
currently selected panel 30 to a new value. The current value of the user-specified data field 1 00 is 
thus equal to this new value. 

At step 200, the data in the second object 1 1 5 associated with the currently selected panel 30 
is modified to reflect the change in the value of the user-specified data field 1 00. This can be done 
by overwriting the old value of the data field 100 as stored in the second object 115 with the new 
value. 
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As indicated at step 210, if further edits are to be made to the data in the currently selected 
panel 30, the flow of method steps proceeds back to step 190 where such changes can be made. 
Otherwise, the flow of method steps proceeds to step 220. 

At step 220, a user may optionally select another panel 30 to view by selecting a tab 
associated with the other desired panel 30 (or by other selection means as known). If a user selects 
a different panel 30, the flow of method steps proceeds back to step 165, and the selected panel 30 
becomes the currently selected panel 30. Otherwise, the flow of method steps proceeds to step 230 
at which data in the notebook instance 20 is submitted by the user for processing. 

At step 240, the application module 60 compares the data values of each of the data fields 1 00 
in each panel 30 as stored in the first object 1 10 and the second object 1 15 of each panel 30. 

At step 250, the application module 60 determines which data fields 100 are "dirty" for each 
panel 30; that is, the application module 60 determines which data fields 100 have a current value 
(as stored in the second object 1 1 5) that is not equal to its initial value (as stored in the first object 
1 10) for each panel 30. 

At step 260, the application module 60 creates a set of dirty data field records 130 that 
identify which data fields 100 are "dirty" as determined at step 250. 

At step 270, the application module 60 transmits to the server 90 the dirty data field records 
130 identifying data fields 100 for each panel 30 that are "dirty", along with current values of the 
data fields 100 for each panel 30 as stored in the second object 1 15 of each panel 30. 

Depending on the specific data update strategy used, it may be necessary to only transmit the 
current values of "dirty" data fields to the server 90 at step 270. 

Preferably, only "dirty" data fields are identified in the dirty data field records 130 so that 
step 270 may be performed more efficiently, particularly when the number of data fields 100 in a 
notebook instance 20 is large. 

At step 280, the dirty data field records 130 may be used by a data update module 140 to 
update the databases 80 with the data as submitted by the user through the panels 30 of the notebook 
instance 20. 

Step 290 marks the end of the method of indicating when changes to values of data fields in 
a document have occurred. 
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In variant embodiments of the invention, first objects 110 and second objects 115 for all 
panels 30 of the notebook instance 20 can be created in the same method step after the notebook 
instance 20 is opened. Furthermore, in variant embodiments of the invention, all panels 30 of the 
notebook instance 20 may be loaded with data (e.g. from the second object 1 15 of each of the 
5 respective panels) in the same method step after the first objects 1 1 0 and second objects 1 1 5 for the 
panels 30 of the notebook instance 20 have been created and initialized with data from the databases 
80. 

The present invention may be particularly useful in Internet applications. The present 
invention may be used in a web-based notebook implementation. By providing a means to track 
10 whether data fields of a document or object are "dirty", different data update strategies may be 
implemented. In particular, data update strategies geared towards maintaining data integrity in 
Q multi-user environments may also be implemented using the present invention. 
'% In a variant embodiment of the invention, in a web-based implementation of the invention, 

) ; ; a Hypertext Markup Language (HTML) page containing an HTML form may be used to accept input 
IS; J from the user. The data to be displayed in the HTML form is stored in one or more databases 
residing on a server. The HTML form can contain multiple data fields, each of which can display 
f a value and/or allow the user to change the value of the data field. For example, an HTML form may 
t=- contain data fields relating to the following: 

l§ (i) a name, which can be represented in a text field; 

20= (ii) an age range, which can be represented using radio buttons (e.g. a 

button for each age group: age 10-18, age 1 9-35, age 36+, etc.); and 
(iii) an email notification switch, which can be represented by a 

checkbox indicating whether or not the user is to receive e- mail 
messages. 

25 In accordance with the present invention, as applied to this variant embodiment, the HTML 

document must also define and create objects that reflect the information in the HTML form. This 
object can be defined and created using an application module for executing scripts coded in 
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browser-based scripting languages such as Javascript or Visual Basic Script, for example. An 
example of a JavaScript definition of such an object for the HTML form may be: 

<SCRIPT> 

5 function (name, age, emailNotify) 

{ 

this.name = name; 
this. age = age; 

this.emailNotify = emailNotify; 

10 } 

</SCRIPT> 

Two such objects are created in accordance with the present invention. As in the preferred 
: embodiment of the invention, the objects are used to create dirty data records for updating the data 
W in associated databases. Essentially, this variant embodiment of the invention is a special case of 
1 5 - the preferred embodiment, if the notebook instance 20 of the preferred embodiment only contains 
f : one panel 30. In this variant embodiment of the invention, the HTML page replaces the notebook 
f= instance 20, and the HTML form replaces the single panel 30. Minor modifications to the system 
CO 1 0 may be made to provide hardware or any other requirements necessary for this web-based 
j™ implementation of the invention. The method of the present invention may also be similarly applied 
20 to this variant embodiment. 

In the specification and in the claims, a document is any object capable of presenting data and 
accepting data as input, or any object that comprises obj ects capable of presenting data and accepting 
data as input. This includes HTML pages, notebooks, or objects containing one or more instances 
of these documents/objects, for example. 
25 With respect to the elements of the system 10 described in this specification, it will be 

apparent to those skilled in the art that the execution of various tasks need not be performed by the 
particular component specified in the description of the preferred embodiment of the invention. It 
will also be apparent to those skilled in the art that components of the systems 1 0 need not reside on 
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a specific computing machine or device as described in the specification, and need not be 
implemented in the specific manner as described in the specification. For example, the components 
of the system 10 may physically reside on a single computing device, or may be distributed across 
multiple computing devices. Data stored in databases may be stored in a single database, or 
distributed across several databases or other storage means. The connections (e.g. 95) of system 10 
can be maintained by any data communication means as is known, including a Ethernet network 
connection, a TCP/IP connection, wireless communication means, or other known connection 
means. The tasks performed by application module 60 or data update module 140 maybe performed 
by multiple modules, or a module different than that suggested in the specification according to the 
preferred embodiment of the invention. 

As will be apparent to those skilled in the art, other variations, modifications and adaptations 
of the systems and methods described herein are possible without departing from the present 
invention, the scope of which is defined in the claims. 
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